Computer Vesion Based Music Information Retrieval
نویسندگان
چکیده
This project relies on computer vision techniques to build a practical music retrieval system. Our approach tackles traditional music identification as a corrupted sub-image retrieval problem from the 2-D spectrogram representation of the original songs. More specifically, a query snippet spectrogram is matched against our database using its descriptor representation. We utilize a novel pairwise boosting method to learn a robust and compact subset of Viola-Jones filters. This subset of filters captures distinctive information of each spectrogram while being resistant to distortion. By applying these filters, we can transform a spectrogram into a low-dimension binary representation. During the query phase, we search for all the candidates that locally match the descriptors of the query snippet. Then, we select the most likely candidate using an occlusion model based on a 2-state HMM. In order to test the performance of our music retrieval system, we rely on a case study: the Reggae music genre of the iTunes Store. The recognition is both fast and accurate, even with noisy background and a short song snipped as query. We believe that this project can have a significant impact on the way people organize and research music. Indeed, this system tags songs automatically, adding music meta data such as album, artist, genre, and art cover. Furthermore, this project also opens up new avenues for live music search and identification.
منابع مشابه
Prototyping a Vibrato-Aware Query-By-Humming (QBH) Music Information Retrieval System for Mobile Communication Devices: Case of Chromatic Harmonica
Background and Aim: The current research aims at prototyping query-by-humming music information retrieval systems for smart phones. Methods: This multi-method research follows simulation technique from mixed models of the operations research methodology, and the documentary research method, simultaneously. Two chromatic harmonica albums comprised the research population. To achieve the purpose ...
متن کاملMeasuring Information in Jazz Improvisation
Scott J. Simon School of Library and Information Science, University of South Florida, 4202 E. Fowler Ave, CIS1040, Tampa, FL 33620-7800. Email:[email protected]. Phone: 813.974.3521. Fax: 813.974.6840 The following interdisciplinary research explores the concept of entropy in jazz improvisation and demonstrates a method for measuring it utilizing standard music notation. Formulas are adapted ...
متن کاملA Scalable Peer-to-Peer System for Music Information Retrieval
Computer Music Journal George Tzanetakis,* Jun Gao,† and Peter Steenkiste†‡ *Computer Science Department, Faculty of Engineering University of Victoria PO BOX 3055 STNCSC Victoria, British Columbia V8W3P6 Canada [email protected] †Computer Science Department ‡Department of Electrical and Computer Engineering Carnegie Mellon University 5000 Forbes Avenue Pittsburgh, Pennsylvania 15213 USA {jungao...
متن کاملVisualization in Audio-Based Music Information Retrieval
Pampalk, and George Tzanetakis *FX Palo Alto Laboratory 3400 Hillview Avenue, Building 4 Palo Alto, California 94304 USA {foote, cooper}@fxpal.com †Austrian Research Institute for Artificial Intelligence (OFAI) Freyung 6/6 A-1010 Vienna, Austria [email protected] ††Department of Computer Science (also Music) University of Victoria PO Box 3055, STN CSC Victoria, British Columbia V8W 3P6 Cana...
متن کاملImproved Skips for Faster Postings List Intersection
Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010